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Summary  of  Phase  PI  Results 
Phase  PI  consists  of  two  tasks: 

[Tl]  Task  Tl:  Analysis  and  design  of  finite  wordlength  implementations  of 
invariant  ^-Systems. 

[T2]  Task  T3:  2-D  and  m-D  ^-system  models. 

This  project  is  an  extensive  collaborative  effort  with  Professor  Peter  H.  Bauer,  De¬ 
partment  of  Electrical  Engineering,  University  of  Notre  Dame,  Notre  Dame,  IN  46556,  who 
is  the  principal  investigator  of 
Grant  No:  N00014-94-1-0387; 

R&T  Project  Code:  3148509—01. 

This  report,  and  the  research  work  indicated  therein,  were  hence  completed  in  cooperation 
with  him. 

Major  part  of  task  Tl  was  carried  out  at  the  University  of  Notre  Dame  by  Dr.  Peter  H. 
Bauer  while  major  part  of  task  T3  was  carried  out  at  the  University  of  Miami  by  Dr.  Kamal 
Premaratne.  Of  course,  the  two  principal  investigators  have  been  in  constant  contact 
throught  the  work. 

The  following  is  a  summary  of  the  phase  PI  results. 

Task  Tl:  Analysis  and  Design  of  Finite  Wordlength  Implementations  of  Linear, 
Time-Invariant  ^-Systems 

The  conclusions  drawn  from  the  work  conducted  for  task  Tl  may  be  summarized  as  follows: 

1.  The  Fixed-Point  Arithmetic  Case:  When  limit  cycle  performance  is  crucial,  the  q- 
operator  implementation  is  preferrable.  The  ^-operator  implementation  is  superior 
with  regard  to  coefficient  sensitivity  issues. 


2.  The  Floating-Point  Arithmetic  Case:  Generally,  the  ^-operator  implementation  out 
performs  its  ^-operator  counterpart.  In  particular,  in  high-order  and  high-speed  ap 
plications,  the  ^-operator  implementation  is  the  best  C'”  ' 
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Prior  to  a  more  detailed  exposition,  first  we  provide  qualitative  justification  for  the 
above  conclusion.  The  state  equations  of  a  5-operator  system  can  be  written  as: 

<5[x](n)  =  Asxin)  +  B6u(n); 
g[x](n)  =  x(n)  +  A  •  ^[x](n). 

where  x  and  u  are  the  state  and  input  vectors,  respectively.  Here,  A  denote  a  positive  real 
constant  (typically,  the  sampling  time).  The  symbol  ^[-j  denotes  the  (5-operator,  that  is, 

^H(„)  =  (Ti,2) 

and  gf-]  denotes  the  usual  5-operator,  that  is, 

5[x](n)  =  x(n -f  1).  (T1.3) 

The  corresponding  formulation  of  (Tl.l)  in  terms  of  the  5-operator  is 

5[x](n)  =  .4,x(n)  +  J5,u(n),  (T1.4) 

where 

Aq  =  I  +  A  ■  As  As  =  Bq  =  A  ■  Bs  Bs  =  (T1.5) 

Now,  given  x  and  u,  both  representations  compute  5[x]  with  a  certain  accuracy. 
Consider  the  (5-operator  formulation  in  (Tl.l).  Here  we  encounter  two  errors: 

1.  The  first  is  due  to  the  computation  of  ^[x],  that  is,  the  first  equation  in  (Tl.l).  We 
will  refer  to  this  equation  as  the  intermediate  equation. 

2.  The  second  is  due  to  the  eventual  computation  of  5[x],  that  is,  the  second  equation 
in  (Tl.l).  We  will  refer  to  this  equation  as  the  update  equation. 

Let  us  assume  that  the  total  error  in  computing  5[x]  is  mainly  due  to  the  intermediate 
equation  in  (Tl.l)  (rather  than  the  update  equation).  Then,  by  choosing  A  sufficiently 
small,  the  total  error  in  computing  5[x]  will  be  approximately  the  error  created  by  the 
update  equation  which  is  small!.  In  this  case,  the  (5-operator  representation  has  better 
finite  wordlength  properties  than  its  5-operator  counterpart  in  (T1.4). 

If,  however,  the  errors  accumulated  in  the  intermediate  and  the  update  equations  in 
(Tl.l)  are  comparable,  5[x]  computed  through  the  ^-operator  representation  will  show 
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approximately  the  same  error  as  that  computed  through  its  ^-operator  counterpart  as¬ 
suming  A  is  sufficiently  small.  If  A  is  not  sufficiently  smaller  than  one,  the  ^-operator 
representation  will  actually  perform  worse  than  the  ^-operator  representation! 


□ 


If  the  error  introduced  in  the  update  equation  is  larger  than  that  in  the  intermediate- 
equation,  the  ^-operator  representation  would  consistently  perform  worse!!  In  reality,  this, 
case  is  very  unlikely  to  occur. 


Next,  a  more  detailed  exposition  follows. 
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Tl.l  The  Fixed-Point  Arithmetic  Case 
We  now  discuss  some  of  the  results  regarding  the  fixed-point  (FXP)  case.  Here,  our  results 
in  fact  indicate  that,  in  case  limit  cycle  behavior  is  crucial,  the  ^-operator  representation 
is  NOT  suitable  with  this  arithmetic  scheme  [1].  Such  a  case  may  occur  when  nonlinear 
systems  are  implemented  through  FXP  6-operator  based  schemes. 


Zero-input  limit  cycles.  Independent  of  A,  zero-input  limit  cycles  cannot  be  avoided 
in  FXP  6-implementations.  This  is  easily  explained  as  follows:  If  A  is  chosen  very  small, 
the  contribution  from  the  intermediate  equation  being  small  (since  6[x]  is  being  multiplied 
by  A),  during  the  update  equation,  ^[x]  can  be  quantized  to  x  creating  a  DC  limit  cycle, 
that  is,  an  incorrect  equilibrium  point  different  from  zero  results.  We  emphasize  that,  most 
of  the  desirable  properties  of  6-operator  implementations  are  based  on  a  small  A.  We  may 
also  show  that,  if  A  is  chosen  larger  (this  case  is  of  course  somewhat  less  important),  DC 
limit  cycles  will  still  exist.  Hence,  6-operator  representations  cannot  be  implemented  limit 
cycle  free  in  FXP  format!  This  fact  is  independent  of  the  particular  realization  of  the 
system. 


Deadband  size.  Since  6-systems  cannot  be  implemented  limit  cycle  free  in  FXP  format, 
it  is  of  interest  to  investigate  te  the  size  of  such  limit  cycles  since,  in  certain  situations, 
such  small  limit  cycle  amplitudes  can  be  tolerated.  It  can  be  shown  that,  the  magnitude  of 
A  determines  the  magnitude  of  the  limit  cycle.  The  smaller  the  A,  the  larger  will  be  the 
deadband  and  hence  the  limit  cycle  magnitude.  An  approximate  relationship  regarding 
this  is 

A  X  size  of  deadband  =  1,  (T1.6) 

where  the  size  of  deadband  is  measured  in  multiples  of  the  quantization  step  size.  Here, 
the  deadband  corresponds  to  that  obtained  by  considering  the  quantization  of  A  ■  6[x]. 
Therefore,  the  usual  choice  of  a  small  A  creates  a  larger  deadband! 


The  input  driven  case.  Although  the  input  driven  case  is  not  part  of  the  originally 
proposed  work,  some  interesting  results  have  been  obtained.  For  small  values  of  A,  there 
exists  a  bounded  input  signal  that  does  not  allow  control  of  the  state  trajectory.  In  other 
words,  given  sufficiently  small  A,  the  state  trajectory  may  not  be  influenced  by  such  an 
input  signal. 

The  influence  of  the  realization.  First,  it  was  necessary  to  develop  a  suitable  scheme 
to  investigate  the  effect  of  realization  on  the  presence  or  absence  of  limit  cycles.  In  this  di¬ 
rection,  for  the  g-operator  case,  a  computer-based  exhaustive  search  algorithm  that  checks 
for  limit  cycles  (DC  and/or  oscillatory)  has  been  developed  [5]. 

As  discussed  before,  we  have  shown  that,  a  stable  linear  time-invariant  6-system  cannot 
be  implemented  limit  cycle  free  in  FXP.  The  size  of  the  deadband  however  also  depends  on 
the  particular  realization,  that  is,  the  structure  of  A^.  Given  a  system  transfer  function, 
there  are  forms  which  minimize  this  deadband  size  with  respect  to  some  appropriately 
chosen  measure.  For  example,  in  order  to  minimize  DC  limit  cycle  amplitude,  one  may 
choose  the  normal  form  (in  terms  of  Ag)  as  a  suitable  candidate. 

The  influence  of  quantization  nonlinearity  and  its  deadzone.  Since  a  larger  deadzone 
implies  larger  DC  limit  cycle  amplitudes,  the  use  of  quantizers  with  reduced,  or  even 
zero,  deadzone  was  therefore  proposed.  In  investigating  first-order  systems,  by  reducing 
the  deadzone,  it  was  found  that,  existence  of  DC  limit  cycles  can  indeed  be  reduced. 
Unfortunately,  other  oscillatory  limit  cycles  will  be  created.  This  phenomenon  is  due  to 
the  increased  gain  exhibited  towards  small  input  signals  by  the  quantizer. 

Scaling.  As  discussed  above,  we  have  shown  that,  independent  of  either  the  form  of 
Ag  or  the  magnitude  of  A,  a  FXP  implemented  6-system  cannot  be  free  of  zero-input  limit 
cycles.  Hence,  scaling  cannot  be  offered  as  a  possible  solution. 

T1.2.  The  Floating-Point  Arithmetic  Case 

The  floating-point  (FLP)  implementation  of  6-systems  is  currently  under  investigation. 
The  results  obtained  so  far  are  very  encouraging,  and  indicate  that,  quantization  errors 
due  to  FLP  arithmetic  have  a  much  smaller  effect  on  the  system  behavior  than  in  the  FXP 
case.  In  fact,  preliminary  results  show  that,  for  6-systems  of  order  three  and  higher,  errors 
in  computing  ^[x]  can  be  made  significantly  smaller  than  for  the  corresponding  ^-systems. 
This  is  because,  for  a  FLP  implementation  of  such  a  system,  errors  created  through  the 
intermediate  equation  are  larger  than  those  created  through  the  update  equation.  As 
previously  mentioned,  in  this  situation,  6-systems  behave  better  than  their  ^-operator 
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counterparts! 


Limit  cycles.  In  FLP  arithmetic,  a  linearly  stable  time  invariant  system,  under  zero- 
input  conditions,  may  exhibit  four  types  of  responses:  A  diverging  response,  an  oscillatory 
periodic  response  of  arbitrary  magnitude,  an  oscillatory  periodic  response  in  underflow, 
or  an  asymptotically  stable  response.  Only  the  last  two  response  types  are  acceptable  in 
practice.  It  is  well  known  that,  the  last  response  type  is  in  fact  a  very  stringent  requirement 
and  is  often  not  required  in  practice.  Results  so  far  obtained  show  that,  when  the  require¬ 
ments  for  a  response  in  underflow  are  compared,  the  ^-system  requires  less  wordlength 
than  its  ^-system  counterpart!  This  advantage  in  fact  grows  with  the  order  of  the  system!! 

Once  the  system  reaches  underflow  conditions,  the  (^-system  again  exhibits  DC  limit 
cycles.  However,  if  the  exponent  register  is  chosen  sufflciently  large,  the  amplitude  of  these 
oscillations  can  be  made  extremely  small  and  hence,  for  all  practical  purposes,  this  problem 
is  solved. 

Deadband  size.  If  the  condition  on  the  mantissa  length  that  guarantees  convergence 
into  underflow  is  satisfied,  then  the  deadband  size  will  be  very  small.  Hence,  it  can  be 
neglected  for  all  practical  purposes.  This  assumes  a  properly  chosen  exponent  register 
length  since  the  exponent  register  length  determines  the  dynamic  range  of  underflow. 

The  Influence  of  the  Nonlinearity.  Unlike  the  FXP  case,  the  characteristic  of  the 
nonlinearity  has  only  a  minor  effect  on  the  system  behavior,  significant  differences  being 
present  only  in  underflow  conditions 

The  Underflow  case.  In  underflow,  the  5-system  seems  to  behave  worse  than  its  q- 
operator  counterpart.  This  is  mainly  due  to  the  fact  that,  a  FLP  system  in  underflow 
essentially  performs  very  similar  to  a  FXP  system.  However,  as  mentioned  above,  if  the 
dynamic  range  of  underflow  is  chosen  properly,  the  system  behavior  in  underflow  is  of  little 
practical  interest. 

Block  Floating-Point  Arithmetic.  Even  for  the  ^-operator  case,  results  regarding  block 
FLP  implementations  are  lacking.  Hence,  investigations  regarding  block  FLP  implemen¬ 
tation  of  5-systems  is  in  its  early  stages.  In  order  to  obtain  a  comparison  between  the  two 
types  of  implementations,  current  research  is  geared  towards  obtaining  results  applicable 
for  the  ^-operator  case. 
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Tl.S  The  Multi- Dimensional  Case 

The  results  on  one-dimensionaJ  (1-D)  ^-operator  implementations  in  FXP  arithmetic  di¬ 
rectly  carry  over  to  the  multi-dimensional  (m-D)  case.  The  existence  of  non-converging 
responses  along  the  boundary  of  the  causality  region  can  easily  be  proven  using  the  same 
type  of  argument  used  in  the  1-D  case.  Consequently,  ^-operator  based  implementations 
of  m-D  systems  cannot  be  implemented  limit  cycle  free  in  FXP. 

Task  T3;  2-D  and  m-D  ^-system  models 

Discrete-time  systems  implemented  using  the  ^-operator,  as  is  clear  from  the  discussion 
above,  exhibit  superior  finite  wordlength  properties  with  FLP  arithmetic.  In  the  case  of 
FXP  arithmetic,  they  still  provide  superior  coefficient  sensitivity.  The  development  of  2-D 
and  m-D  models  applicable  for  ^-operator  implementations  was  hence  motivated  with  the 
expectation  that  these  properties  would  still  hold  true. 

The  conclusions  drawn  from  the  work  conducted  for  task  T3  may  be  summarized  as 
follows:  Similar  to  the  1-D  case,  under  FLP  arithmetic,  the  ^-operator  implementation  of 
2-D  and  m-D  discrete-time  systems  provides  the  best  choice.  Again,  this  is  particularly 
true  in  high-order  and  high-speed  applications. 

State-space  models.  In  Roesser  local  s.s.  model  of  ^-operator  formulated  2-D  discrete¬ 
time  systems  takes  the  form 


r4"  4^’l 

’x'‘(f,i)' 

+ 

_ 1 

.x"(f,i) 

^  [A  1 

+  [^9]U( 

*,i); 

.x"(*,i) 


x"(i,i) 


+  [DgMiJ) 


(T3.1) 


where  A\  ^  is  of  size  x  is  of  size  riy  x  n„,  etc.  Also,  5/,[-]  and  g„[-]  denote  the 

horizontal  and  vertical  shift  operators,  that  is. 


9*[x](*,i)  =  x(i -1- l,i)  and  ?,;[x](i,;)  =  x(i,  j -f- 1).  (T3.2) 


To  exploit  the  advantages  of  <5-operator  implementations,  analogous  to  the  1-D  ca.se. 
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we  define  the  operators 


4[x](i,i)  = 


x(^  +  l,i)  -  x(i,j)  ^  qh[x.]{ij) 

A/i 

x(z,i  + 1)  -  x(i,i)  9„[x](i,i)  -  x(i,i) 


(T3.3) 


A„ 


A, 


where  and  A„  are  two  positive  real  constants.  The  corresponding  ^-operator  s.s.  model 
may  then  be  obtained  as 


[Ad) 

Ad)] 

x'‘(i,i)' 

_<5„[x‘’](i,i) 

Ad) 

Ad) 

yihj). 

+ 

5(2) 

=  [^] 


x'*(^i) 


[x*’(*,i)J 


x^(i,i) 


(T3.4) 


=  [C] 


x'‘(^i) 

x^(z,;) 


[x"(i,i) 


+  P]u(i,j) 


This  is  the  2-D  Aversion,  of  th.e  intermediate  ecjuation  mentioned  earlier.  In  addition^  as  for 
the  1-D  case,  we  have  the  following  update  equations: 


9A[x'‘](*,i)  =  x'‘(i,i)  +  Ah  •  4[x''](i,i); 

9t.[x*’](i,i)  =  x^(z,i)  +  A„  • 


(T3.5) 


Note  that, 


(T3.6) 


Aq  =  I  +  A  ■  As  ^4^  =  A  ^  •  {Ag  —  /„); 

Bq  =  A  •  B  Bs  =  A~^  ■  Bq\ 

Cq  =  Cs  <=^  Cs=  Cq- 
Dq  =  Ds  <=>  Ds  =  Dq. 

Here,  A  =  [Aa/„^  0  A„/„  J  is  of  size  (ns  +  n^)  x  (n/,  +  n„). 

The  associated  system  theoretic  notions,  such  as,  transition  matrix,  transfer  function, 
characteristic  equation,  etc.,  have  also  been  introduced.  This  s.s.  model  is  the  basis  for 
designing  2-D  filters  with  superior  finite  wordlength  properties.  The  design  procedures 
developed  are  expected  to  be  extremely  useful  in  obtaining  high-Q  2-D  and  m-D  digital 
filters  that  are  suitable  for  high-speed  applications. 

Stability.  In  the  1-D  case,  it  has  been  shown  that,  direct  techniques  with  no  recourse 
to  transformations  (that  first  converts  a  given  ^-system  to  its  g-system  counterpart)  can 
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provide  numerically  more  reliable  stability  checking  algorithms.  With  this  in  mind,  for  the 
2-D  case,  a  direct  stability  checking  technique  applicable  to  the  corresponding  6-system 
transfer  function  has  been  introduced.  For  this  purpose,  a  recently  developed  tabular  form 
was  extended  to  the  complex  coefficient  case  and  the  notion  of  Schur-Cohn  minors  was 
introduced  to  the  6-operator  case. 

Gramians  and  balanced  realization.  The  notions  of  reachability  and  observability 
gramians  and  balanced  realization  have  been  introduced  for  the  6-operator  case.  In  order 
to  do  this,  first,  the  relationship  between  the  gramians  for  the  6-  and  g-operator  cases,  as 
defined  in  the  literature,  was  established.  The  reachability  and  controllability  gramians, 
that  is,  P  and  Q,  respectively,  for  1-D  6-systems  were  found  to  satisfy 

where  Ts  is  the  stability  boundary  applicable  for  6-systems,  that  is,  7^  =  {c  6  S  :  |c  -f 
1/A|  =  1/A}.  An  extension  of  this  is  then  used  to  define  the  2-D  gramians  of  6-systems 
represented  in  the  Roesser  model  developed  above. 

For  the  important  class  of  separable  (that  is,  separable-in-denominator)  systems,  it 
is  shown  that  these  gramians  may  be  computed  through  the  solution  of  four  Lyapunov 
equations.  These  notions  and  results  are  useful  in  many  applications,  such  as,  in  extracting 
reduced  order  models  of  6-systems. 

Sensitivity.  Measures  that  indicate  coefficient  sensitivity  of  the  6-models  developed 
above  have  been  introduced.  Unlike  what  is  available  in  literature,  this  development  is 
applicable  to  the  MIMO  case  as  well.  With  these  sensitivity  measures  as  a  guide,  devel¬ 
opment  of  minimum  sensitivity  structures  has  been  carried  out.  The  connection  with  the 
corresponding  balanced  realizations  has  been  pointed  out. 

Roundoff  noise.  With  the  use  of  a  noise  model  that  takes  into  account  the  roundoff 
error  propagation  in  the  s.s.  model  developed  above,  structures  that  minimize  roundoff 
noise  have  been  developed. 
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